منابع مشابه
Recognition of continuous speech using neural nets and expert system
A system for recognising continuously spoken sentences is presented. The system has a vocabulary of approx. 35 words and a granunar specifying a few thousand sentences. The system operates in three stages. In the frrst stage, cepstrum vectors are computed in real time and used as input to a self organised neural network. The output of the network is mapped to a continuous valued acoustic phonet...
متن کاملA non-expert Kaldi recipe for Vietnamese Speech Recognition System
In this paper we describe a non-expert setup for Vietnamese speech recognition system using Kaldi toolkit. We collected a speech corpus over fifteen hours from about fifty Vietnamese native speakers and using it to test the feasibility of our setup. The essential linguistic components for the Automatic Speech Recognition (ASR) system was prepared basing on the written form of the language inste...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملSpeech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملSpeech recognition using EMG; mime speech recognition
The cellular phone offers significant benefits but causes several social problems. One such problem is phone use in places where people should not speak, such as trains and libraries. A communication style that would not require voiced speech has the potential to solve this problem. Speech recognition based on electromyography (EMG), which we call "Mime Speech Recognition" is proposed. It not o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of the Acoustical Society of America
سال: 1988
ISSN: 0001-4966
DOI: 10.1121/1.2025950